New Trends in Classification and Data Mining

نویسندگان

  • Krassimir Markov
  • Vladimir Ryazanov
  • Vitalii Velychko
  • Levon Aslanyan
  • Konstantin Rangochev
  • Maxim Goynov
  • Desislava Paneva-Marinova
  • Detelin Luchev
چکیده

The observation of the lexical structure of the Bulgarian folklore is very important task for different science domains such as folkloristic, ethnology, linguistics, computational linguistics, Bulgarian language history, etc. Until today, such a linguistic analysis hasn’t been made; it is unclear what is the lexical structure of Bulgarian folklore works. First attempt for computational lexical analysis of the Bulgarian folklore and its constituents is made during the "Knowledge Technologies for Creation of Digital Presentation and Significant Repositories of Folklore Heritage" 1. During the project the Bulgarian folklore digital library (BFDL) is designed and developed. In its structure it is implemented linguistic components, whose aim is the realization of different types of analysis of folk objects from a text media type. Thus, we lay the foundation of the linguistic analysis services in digital libraries aiding the research of kinds, number and frequency of the lexical units that constitute various folk objects. This paper presents basic types of dictionaries needed to carry out such linguistic analysis. It describes the BDFL Linguistics Search in sets of folklore objects of text media type and a linguistic component for frequency analysis of the folklore vocabulary. Finally, a project for implementation of a dictionary concordances of songs, prose, interviews, etc. is outlined.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Topic Modeling and Classification of Cyberspace Papers Using Text Mining

The global cyberspace networks provide individuals with platforms to can interact, exchange ideas, share information, provide social support, conduct business, create artistic media, play games, engage in political discussions, and many more. The term cyberspace has become a conventional means to describe anything associated with the Internet and the diverse Internet culture. In fact, cyberspac...

متن کامل

A New Algorithm for Optimization of Fuzzy Decision Tree in Data Mining

Decision-tree algorithms provide one of the most popular methodologies for symbolic knowledge acquisition. The resulting knowledge, a symbolic decision tree along with a simple inference mechanism, has been praised for comprehensibility. The most comprehensible decision trees have been designed for perfect symbolic data. Classical crisp decision trees (DT) are widely applied to classification t...

متن کامل

The application of data mining techniques in manipulated financial statement classification: The case of turkey

Predicting financially false statements to detect frauds in companies has an increasing trend in recent studies. The manipulations in financial statements can be discovered by auditors when related financial records and indicators are analyzed in depth together with the experience of auditors in order to create knowledge to develop a decision support system to classify firms. Auditors may annot...

متن کامل

S3PSO: Students’ Performance Prediction Based on Particle Swarm Optimization

Nowadays, new methods are required to take advantage of the rich and extensive gold mine of data given the vast content of data particularly created by educational systems. Data mining algorithms have been used in educational systems especially e-learning systems due to the broad usage of these systems. Providing a model to predict final student results in educational course is a reason for usi...

متن کامل

Customer Retention Based on the Number of Purchase: A Data Mining Approach

Purpose: this study wants to find any relationship between the numbers of purchase and the income the customer brings to the company. The attempt is to find those customers who buy more than one life insurance policy and represent the signs of good payments at the same time by the help of data mining tools. Design/ methodology/ approach: the approach of this research is to use data mining tools...

متن کامل

Emerging Trends in Associative Classification Data Mining

Utilising association rule discovery to learn classifiers in data mining is known as Associative Classification (AC). In the last decade, AC algorithms proved to be effective in devising high accurate classification systems from various types of supervised data sets. Yet, there are new emerging trends and that can further enhance the performance of current AC methods or necessitate the developm...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010